Ground Delay Program Analytics with Behavioral Cloning and Inverse Reinforcement Learning
نویسندگان
چکیده
We used historical data to build two types of model that predict Ground Delay Program implementation and also produce insights into how and why those implementation decisions are made. More specifically, we built behavioral cloning and inverse reinforcement learning models that predict hourly Ground Delay Program implementation at Newark Liberty International and San Francisco International airports. Data available to the models include actual and scheduled air traffic metrics and observed and forecasted weather conditions. We found that the random forest behavioral cloning models we developed are substantially better at predicting hourly Ground Delay Program implementation for these airports than the inverse reinforcement learning models we developed. However, all of the models struggle to predict the initialization and cancellation of Ground Delay Programs, which are both rare events. We also investigated the structure of the models in order to gain insights into Ground Delay Program implementation behavior. Notably, characteristics of both types of model suggest that GDP implementation decisions are made primarily based on conditions now or conditions anticipated in the next couple of hours.
منابع مشابه
Compatible Reward Inverse Reinforcement Learning
PROBLEM • Inverse Reinforcement Learning (IRL) problem: recover a reward function explaining a set of expert’s demonstrations. • Advantages of IRL over Behavioral Cloning (BC): – Transferability of the reward. • Issues with some IRL methods: – How to build the features for the reward function? – How to select a reward function among all the optimal ones? – What if no access to the environment? ...
متن کاملThe Modulation of Operant Variation by the Probability, Magnitude, and Delay of Reinforcement.
Recent studies have demonstrated that the expectation of reward delivery has an inverse relationship with operant behavioral variation (e.g., Stahlman, Roberts, & Blaisdell, 2010). Research thus far has largely focused on one aspect of reinforcement - the likelihood of food delivery. In two experiments with pigeons, we examined the effect of two other aspects of reinforcement: the magnitude of ...
متن کاملInverse Optimal Control for Humanoid Locomotion
In this paper, we present a method for learning the reward function for humanoid locomotion from motioncaptured demonstrations of human running. We show how an approximate, local inverse optimal control algorithm can be used to learn the reward function for this high dimensional domain, and demonstrate how trajectory optimization can then be used to recreate dynamic, naturalistic running behavi...
متن کاملEfficient Probabilistic Performance Bounds for Inverse Reinforcement Learning
In the field of reinforcement learning there has been recent progress towards safety and high-confidence bounds on policy performance. However, to our knowledge, no methods exist for determining high-confidence safety bounds for a given evaluation policy in the inverse reinforcement learning setting—where the true reward function is unknown and only samples of expert behavior are given. We prop...
متن کاملThe effect of emotional social learning program on improving academic achievement and reducing behavioral disorders in people with Conduct disorder
Introduction: Conduct disorder is one of the major problems in adolescence. The purpose of this study was to determine the effectiveness of social-emotional learning program on improvement of academic achievement and reduction of behavioral disorders among people with behavioral disorders. Methods: The present study was a semi-experimental design included two groups of subjects. A group of 20 p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Aerospace Inf. Sys.
دوره 12 شماره
صفحات -
تاریخ انتشار 2015